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The ratio of the cross section for pp interactions producing a Z boson and at least one b quark jet 
to the inclusive Z +jet cross section is measured using 4.2 fb _1 of pp collisions collected with the DO 
detector at the Fermilab Tevatron collider at -*/s — 1.96 TeV. The Z — > £ + £~ candidate events with 
at least one b jet are discriminated from Z+ charm and light jet(s) events by a novel technique that 
exploits the properties of the tracks associated to the jet. The measured ratio is 0.0193 ± 0.0027 for 
events having a jet with transverse momentum pr > 20 GeV and pseudorapidity |?j| < 2.5, which is 
the most precise to date and is consistent with theoretical predictions. 

PACS numbers: 12.38.Qk, 13.85.Qk, 14.65.Fy, 14.70.Hp 



The measurement of the production cross section for a 
Z boson in association with b jets provides an important 
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test of perturbative quantum chromodynamics (QCD) 
predictions [lj . A good description of this process by the- 
oretical calculations is essential since it is a major back- 
ground to searches for the standard model (SM) Higgs 
boson via ZH(H — > bb) associated production 0] and for 
the supersymmetric partners of b quarks [3J. This pro- 
cess is also sensitive to the b quark density in the proton 
needed to predict phenomena such as single top quark 
production [3] and production of non-SM Higgs bosons 
in association with b quarks [5]. Calculations for the Z 
boson production in association with b quarks in pp colli- 



4 



sions are available at next-to-leading order (NLO) using 
two different approaches d i], and they agree within 
their respective theoretical uncertainties. 

In this Letter, we describe a measurement of the ra- 
tio of the inclusive cross sections for Z boson production 
with at least one b quark jet to the Z + jet(s) produc- 
tion in pp interactions, where the Z boson is identified 
via its Z —> ee and Z — > p,p decay modes. The Z + 6 jet 
events are separated from Z boson production with light 
(it, d, or s quarks, or gluons) and charm (c) jet(s) by 
a discriminant that exploits the properties of the tracks 
associated to the jet. The measurement of the ratio ben- 
efits from cancellations of many systematic uncertainties 
on the cross sections and therefore allows a more pre- 
cise comparison with theoretical calculations. Previous 
measurements by the DO Q and CDF Q collaborations 
agree with the SM predictions. Here, we present the most 
precise measurement of the ratio to date. This measure- 
ment is a significant improvement over the previous DO 
result which utilized 0.18 fb^ 1 of integrated luminos- 
ity and assumed the ratio of the Z + b jet cross section to 
the Z + c jet cross section from NLO calculations. This 
analysis uses a much larger dataset and a substantially 
improved method to extract the different jet flavor frac- 
tions. 

We use data from pp collisions at a center-of-mass en- 
ergy of 1.96 TeV collected by the DO detector @ at the 
Fermilab Tevatron between 2006 and 2009 and corre- 
sponding to an integrated luminosity of 4.2 fb -1 . The 
selected events are required to pass at least one of the 
single electron or single muon triggers. The efficiency 
of the triggers, as measured from data, is close to 100% 
(78%) for the Z ->• ee [Z -4 fifi) final state. 

This analysis relies on all components of the detector: 
tracking, calorimetry, and muon system and the ability to 
identify detached vertices. The DO detector consists of a 
central tracking system, comprising a silicon microstrip 
tracker (SMT) and a central fiber tracker, both within 
a 2 T solenoidal magnet; a liquid- argon and uranium 
calorimeter, divided into a central calorimeter and two 
endcap calorimeters; and a muon system, consisting of 
three layers of tracking detectors and scintillation trig- 
ger counters. The SMT allows a precise reconstruction 
of the pp interaction vertex (PV) and of eventual sec- 
ondary vertices (SV), and an accurate determination of 
the impact parameter (IP) of a track relative to the PV, 
which are the key components of the jet lifetime-based 
^-tagging algorithms. 

Offline event selection requires a reconstructed PV that 
has at least three associated tracks and is located within 
60 cm of the center of the detector in the coordinate along 
the beam direction. The selected events must contain a Z 
boson candidate with a dilepton invariant mass 70 GeV < 
mu < 110 GeV. Throughout this Letter we use Z boson 
to denote any dilepton event in the above mentioned mass 
range due to Z or 7* production. 

The dielectron (ee) selection requires at least two elec- 
trons of transverse momentum px > 15 GeV identified by 



electromagnetic (EM) showers in the central (with pseu- 
dorapidity [To[ \rj\ < 1-1) or endcap (1.5 < \r)\ < 2.5) 
calorimeter. The showers must have a significant frac- 
tion of their energy deposited in the EM calorimeter, be 
isolated from other energy depositions, and have a shape 
consistent with that expected for an electron. The cen- 
tral electrons, in addition, must match central tracks or 
produce electron-like patterns of hits in the tracker. 

The dimuon (pfi) selection requires at least two muons 
with segments in the muon spectrometer matched to cen- 
tral tracks with pr > 10 GeV and |ry| < 2. Combined 
tracking and calorimeter isolation requirements are ap- 
plied to the muon candidates. Muons from cosmic rays 
are rejected by applying a timing criterion to the hits in 
the scintillator layers as well as restricting the position of 
the muon track with respect to the PV. The two muons 
must also have opposite electric charges. 

A total of 411,064 (224,814) Z boson candidate events 
are retained in the ee (/i^t) channel. The Z + jet sam- 
ple is then selected by requiring the presence of at least 
one reconstructed jet with < 2.5, with the leading 
jet having pr > 20 GeV and any additional jets hav- 
ing pt > 15 GeV. Jets are reconstructed from energy 
deposits in the calorimeter using the iterative midpoint 
cone algorithm [11] with a cone of radius 0.5. The energy 
of jets is corrected for detector response, the presence of 
noise and multiple pp interactions, and the energy de- 
posited outside of the jet cone used for reconstruction. 
Jets containing b quarks have a different energy response 
and receive an average additional energy correction of 
about 6% as determined from simulations. Events with 
missing transverse energy larger than 60 GeV are rejected 
to suppress the background from it production. These se- 
lection criteria yield a sample of 48,956 (24,450) Z+jet 
events in the ee (/x/i) channel. 

Jets considered for 6-tagging are subject to a prese- 
lection, called taggability, to decouple the intrinsic b jet 
tagging algorithm performance from other effects. For 
this purpose, the jet is required to have at least two as- 
sociated tracks with px > 0.5 GeV, the leading track 
must have px > 1.0 GeV, and each track must have at 
least one SMT hit. This requirement has a typical effi- 
ciency of 90% per jet. The jet related efficiencies men- 
tioned here and later on are determined from simulations 
and corrected for the difference observed in data. In or- 
der to enrich a sample with heavy-flavor jets, a neural 
network (NN) based 6-tagging algorithm is applied that 
exploits the longer lifetimes of 6-flavored hadrons in com- 
parison to their lighter counterparts [Hj]. The inputs to 
the NN combine several characteristic quantities of the 
jet and associated tracks to provide a continuous output 
value that tends towards one for b jets and zero for non-6 
jets. The important input variables are the number of re- 
constructed SV in the jet, the invariant mass of charged 
particles associated with the SV (Msv), the number of 
tracks used to reconstruct the SV, the two-dimensional 
decay length significance of the SV in the plane transverse 
to the beam, a weighted combination of the tracks trans- 
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FIG. 1: The probability densities of the discri- 
minant for b, c and light jets passing the NN b tagging 
requirement. Also shown is the distribution for the neg- 
ative tagged (NT) jets in data, described in the text. 



verse IP significances, and the probability that the tracks 
from the jet originate from the PV, which is referred to 
as the Jet Lifetime Probability (JLIP). We require at 
least one of the jets in the event to have a NN output 
greater than 0.5. In the case where the leading jet is not 
tagged, we apply the NN selection to sub-leading jets. A 
total of 2,200 (1,015) events with at least one b tagged jet 
candidate are thus selected in the ee (////) channel. The 
tagging efficiency for b jets and the mistagging rate of 
light jets are parametrized as functions of jet px and m 
and are about 58% and 2%, respectively, averaged over 
the kinematics of jets considered in this analysis. 

To further separate b jets from c and light jets, we con- 
struct a discriminant (Djjjj>) from the combination of 
M sv and JLIP, = (M S v/10 GeV - ln(JLIP)/40). 

The relative weights of the variables are selected based on 
studies of simulated data to maximize rejection of c and 
light quark jets. The mass Msv provides good discrim- 
ination between b, c, and light jets due to the different 
masses of the quarks. Jets from b quarks usually have 
large values of — ln(JLIP), while light jets mostly have 
small values, as their tracks originate from the PV. The 
average efficiency for the b jets in data to have a well- 
defined Ujlip output is about 68%, which is due to the 
finite efficiency for a b jet to have a reconstructed SV. 
Figure [T] shows the normalized distributions of Dj^p for 
jets of different flavors after the NN b tagging require- 
ment. The discriminant -Djlip separates well between &, 
c, and light jets. Figure Q] also shows the -Djlip distribu- 
tion of the tagged jets derived from a light jet enriched 
data sample, referred to as negatively tagged (NT) data. 
NT jets have negative values for some of the inputs for 
the NN algorithm such as decay length significance 
and IP which are caused by the detector resolution ef- 



fects. We estimate the b jet contamination in the NT 
data using a maximum likelihood fit and subtract its con- 
tribution. The template shapes in the corrected NT data 
and the light jets in Monte Carlo (MC) simulation look 
similar and the small difference is taken as a systematic 
uncertainty. 

The dominant background to Z + jet production 
arises from multijet (MJ) events in which jets are mis- 
reconstructed as leptons, especially in the ee channel. 
This instrumental background is estimated from data. 
We use MJ-enriched data samples that pass all event se- 
lection requirements, but fail some of the lepton quality 
criteria, to determine the kinematic shape of the back- 
ground distribution. For the ee channel, the MJ sample 
is obtained by inverting the shower shape requirements 
and relaxing other identification criteria on the electron 
candidates. For the fifi channel, the MJ sample consists 
of events with muon candidates that fail the isolation 
criteria. 

Smaller background contributions arise from top quark 
pair (tt) and diboson (WW, WZ, ZZ) production, 
which contain two leptons in the final state. These 
backgrounds are estimated using MC simulations with 
the cross sections rescaled to match theoretical calcula- 
tions [HI [3] ■ We simulate inclusive diboson production 
with PYTHIA [lij]. Events from Z+]et and tt processes are 
generated with ALPGEN [l6| , interfaced with pythia for 
initial and final state radiation and for hadronization. 
For these events, a matching procedure is used to avoid 
double counting of partons produced by ALPGEN and 
those subsequently added by the showering in pythia. 
The Z+jets samples consist of Z+light jets and a Z+ 
heavy-flavor component, which includes Z + bbfcc) pro- 
duction. All simulations use the CTEQ6L1 pjj parton 
distribution functions (PDFs) . All samples are processed 
using a detector simulation based on GEANT3 [l8| and 
the same offline reconstruction algorithms as for data. 
Events from randomly chosen beam crossings are over- 
laid on the simulated events to reproduce the effect of 
multiple pp interactions and detector noise. The normal- 
izations of the simulated and the MJ backgrounds are 
adjusted by scale factors determined from a fit to the 
mu distributions in the inclusive untagged sample. The 
background fraction in the ee channel is about 18% for 
both the inclusive untagged and tagged samples, and is 
dominated by the MJ background. The fifi channel has a 
higher purity, with a background fraction of only about 
0.8% in the untagged and tagged samples. 

Corrections are applied to the simulated events to im- 
prove the MC modeling. The simulated Z — > fifi events 
are weighted with trigger efficiencies measured in data. 
For the ee channel, no correction is applied as the corre- 
sponding trigger is nearly 100% efficient. Lepton identi- 
fication efficiencies are corrected as a function of 77, az- 
imuthal angle 4>, and the z position of the PV. Jet en- 
ergies are smeared to reproduce the resolution observed 
in data, and the efficiency for reconstructing a jet is cor- 
rected to match the one in data. The simulated Z boson 
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FIG. 2: (color online) The observed pr distribution of 
the leading jet in the (a) and (b) ee channel comp- 
ared with the SM prediction. The uncertainties on the 
data points are statistical, and the prediction is normal- 
ized to the data, as described in the text. 



events are reweighted such that the pt distribution of 
the Z boson is consistent with the observed distribution. 
Figures Hta,b) show the pr distribution of the leading jet 
in data compared with the expectation from simulation 
for Z+jets inclusive events and the associated contribu- 
tions in each channel. The dominant contribution comes 
from Z+light jet production. 

In order to measure the fraction of events with differ- 
ent jet flavors in the final selected sample, we perform 
a binned maximum likelihood fit to the -D^p distribu- 
tion in data using a combination of the light, c, and b 
flavor jet templates. Before the fit, we subtract the non- 
(Z+ jet) background contributions. A total of 970 (630) 
events remains in the ee (/x/x) channel passing all selec- 
tion requirements and after the background subtraction. 
The b and c jet -Djlip templates are taken from MC sim- 
ulations with correction factors applied to account for 
the differences in data and MC efficiencies. The light jet 
template is obtained from the higher statistics NT data 
described earlier. The jet flavor fractions obtained in the 
ee and \x\i channels are shown in Table U where the un- 
certainties are from the fit due to the data and template 
statistics. The relative light and c quark fractions are 
not tightly constrained by the data. The b jet fraction is, 
however, largely insensitive to variations in the relative 
amount of light and c jets. Since the individual samples 
yield consistent results, we combine the ee and \x\x sam- 
ples and remeasure the fractions using an independent fit. 
The -D^jp distributions in the two data samples used for 
fitting agree after background subtraction. The last col- 
umn of Table U gives the results of the jet flavor fractions 
from the combined sample. Figure [3] shows the combined 
-Djljp distribution of &-tagged jets for data along with 



the fitted contributions from the light (NT data), c and 
b jets. 

The extracted jet flavor fractions are used to deter- 
mine the ratio cr(Z + b]et)/a(Z + jet) as follows: 



TABLE I: Jet flavor fractions obtained from template 
fitting in the dielectron, dimuon and combined 
channels, along with statistical uncertainties. 



Channel 



Combined 



Events 630 970 1600 

Z + b 0.248 ± 0.042 0.267 ± 0.036 0.259 ± 0.028 
Z + c 0.253 ± 0.073 0.364 ± 0.064 0.359 ± 0.049 
Z + light 0.499 ± 0.058 0.369 ± 0.049 0.382 ± 0.038 
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FIG. 3: (color online) The D^p discriminant distribut- 
ion of events in the combined sample. The distributions 
of the b, c, and light jets are weighted by the fractions 
found from the fit. Uncertainties are statistical only. 



cr(Z + 6jct) _ N b 
<j(Z+ jet) ~ N incl e^ 



(1) 



c b/i: 



where N- lnc i is the total number of Z + jet events before 
any tagging requirement, Nb is the number of Z + 6 jet 
events obtained from the Djlip e & aS * s * ne overall 
Djlip efficiency for b jets, which combines the efficiencies 
for taggability, NN tagger and -D^p selection, and e^;° cl 
accounts for the difference between b and inclusive jet 
reconstruction efficiencies. 

Several experimental uncertainties cancel out in the 
measurement of o(Z + b]et)/cr(Z + jet), including the 
uncertainties on the luminosity, and trigger, lepton, and 
some jet identification efficiencies. The two largest re- 
maining sources of systematic uncertainty are uncertain- 
ties in the -Djlip efficiency and in the shape of the -Djlip 
templates used for the extraction of the b jet fraction. 
Variation in -Dj^p efficiency by one standard deviation 
results in an uncertainty of 3.7% on the final result. The 
uncertainty due to the shape of the templates (4.2%) is 
estimated by using an alternate light jet template from 
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MC, by changing the b quark fragmentation function [ll|, 
and by varying the fraction of merged heavy quarks (bb, 
cc) inside the jet. Other important sources of uncertainty 
are the b tagging efficiency (2.4%), the b jet energy scale 
(2%), and reconstruction efficiency (3.2%). The total 
systematic uncertainty on the measurement of the ratio 
is 7.7%. The final result is 

aTzl^eti = 0.0f93±0.0022(stat)±0.0015(syst), (2) 

which is consistent with the ratios obtained separately 
for the two channels. This measurement is the most pre- 
cise to date. For the kinematic region considered in the 
analysis, an NLO MCFM [l[ prediction for the ratio yields 
0.0192 ± 0.0022; this is obtained for the renormalization 
and factorization scales Q% = Q 2 F = m 2 z (mz being the 
Z boson mass), and with the MSTW2008 PDFs 0. The 
prediction decreases by 3.6% when the effects from de- 
tector response, resolution as well as hadronization and 
underlying event are taken into account. 

In summary, we have performed the most precise mea- 



surement to date of the ratio of the cross section for Z 
boson production in association with at least one b jet to 
the inclusive Z+]et cross section, considering final states 
with Z —> ee and Z — > /i/i and jets with px > 20 GeV 
and 1 7/| < 2.5. The combined measurement of the ra- 
tio yields 0.0193 ± 0.0027, which is consistent with NLO 
QCD calculations. 
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